智能论文笔记

Can We Use Neural Regularization to Solve Depth Super-Resolution?

Milena Gazdieva , Oleg Voynov , Alexey Artemov , Youyi Zheng , Luiz Velho , Evgeny Burnaev

分类：计算机视觉 | 机器学习

2021-12-21

使用商品传感器捕获的深度映射通常需要在应用中使用超分辨率。在这项工作中，我们研究了一种基于与Tikhonov正规的变分问题陈述的超分辨率方法，其中规范器与深神经网络参数化。这种方法以前在光声断层扫描中成功应用。我们通过实验表明它在深度地图超级分辨率的应用很困难，并提供关于该原因的建议。

translated by 谷歌翻译

Unpaired Depth Super-Resolution in the Wild

Aleksandr Safin , Maxim Kan , Nikita Drobyshev , Oleg Voynov , Alexey Artemov , Alexander Filippov , Denis Zorin , Evgeny Burnaev

分类：计算机视觉

2021-05-25

用商品传感器捕获的深度图通常具有低质量和分辨率；这些地图需要增强以在许多应用中使用。深度图超分辨率的最新数据驱动方法依赖于同一场景的低分辨率和高分辨率深度图的注册对。采集现实世界配对数据需要专门的设置。另一个替代方法是通过亚采样，添加噪声和其他人工降解方法从高分辨率地图中生成低分辨率地图，并不能完全捕获现实世界中低分辨率图像的特征。结果，对这种人造配对数据训练的监督学习方法可能在现实世界中的低分辨率输入上表现不佳。我们考虑了一种基于从未配对数据学习的深度超分辨率的方法。尽管已经提出了许多用于未配对图像到图像翻译的技术，但大多数技术无法使用深度图提供有效的孔填充或重建精确表面。我们提出了一种未配对的学习方法，用于深度超分辨率，该方法基于可学习的降解模型，增强成分和表面正常估计作为特征，以产生更准确的深度图。我们为未配对的深度SR提出了一个基准测试，并证明我们的方法的表现优于现有的未配对方法，并与配对相当。

translated by 谷歌翻译

Prediction of Auto Insurance Risk Based on t-SNE Dimensionality Reduction

Joseph Levitas , Konstantin Yavilberg , Oleg Korol , Genadi Man

分类：人工智能 | (统计)机器学习

2022-12-19

Correct scoring of a driver's risk is of great significance to auto insurance companies. While the current tools used in this field have been proven in practice to be quite efficient and beneficial, we argue that there is still a lot of room for development and improvement in the auto insurance risk estimation process. To this end, we develop a framework based on a combination of a neural network together with a dimensionality reduction technique t-SNE (t-distributed stochastic neighbour embedding). This enables us to visually represent the complex structure of the risk as a two-dimensional surface, while still preserving the properties of the local region in the features space. The obtained results, which are based on real insurance data, reveal a clear contrast between the high and low risk policy holders, and indeed improve upon the actual risk estimation performed by the insurer. Due to the visual accessibility of the portfolio in this approach, we argue that this framework could be advantageous to the auto insurer, both as a main risk prediction tool and as an additional validation stage in other approaches.

translated by 谷歌翻译

Atrous Space Bender U-Net (ASBU-Net/LogiNet)

Anurag Bansal , Oleg Ostap , Miguel Maestre Trueba , Kristopher Perry

分类：计算机视觉

2022-12-16

$ $With recent advances in CNNs, exceptional improvements have been made in semantic segmentation of high resolution images in terms of accuracy and latency. However, challenges still remain in detecting objects in crowded scenes, large scale variations, partial occlusion, and distortions, while still maintaining mobility and latency. We introduce a fast and efficient convolutional neural network, ASBU-Net, for semantic segmentation of high resolution images that addresses these problems and uses no novelty layers for ease of quantization and embedded hardware support. ASBU-Net is based on a new feature extraction module, atrous space bender layer (ASBL), which is efficient in terms of computation and memory. The ASB layers form a building block that is used to make ASBNet. Since this network does not use any special layers it can be easily implemented, quantized and deployed on FPGAs and other hardware with limited memory. We present experiments on resource and accuracy trade-offs and show strong performance compared to other popular models.

translated by 谷歌翻译

Inverting Cryptographic Hash Functions via Cube-and-Conquer

Oleg Zaikin

分类：人工智能

2022-12-05

MD4 and MD5 are seminal cryptographic hash functions proposed in early 1990s. MD4 consists of 48 steps and produces a 128-bit hash given a message of arbitrary finite size. MD5 is a more secure 64-step extension of MD4. Both MD4 and MD5 are vulnerable to practical collision attacks, yet it is still not realistic to invert them, i.e. to find a message given a hash. In 2007, the 39-step version of MD4 was inverted via reducing to SAT and applying a CDCL solver along with the so-called Dobbertin's constraints. As for MD5, in 2012 its 28-step version was inverted via a CDCL solver for one specified hash without adding any additional constraints. In this study, Cube-and-Conquer (a combination of CDCL and lookahead) is applied to invert step-reduced versions of MD4 and MD5. For this purpose, two algorithms are proposed. The first one generates inversion problems for MD4 by gradually modifying the Dobbertin's constraints. The second algorithm tries the cubing phase of Cube-and-Conquer with different cutoff thresholds to find the one with minimal runtime estimation of the conquer phase. This algorithm operates in two modes: (i) estimating the hardness of an arbitrary given formula; (ii) incomplete SAT-solving of a given satisfiable formula. While the first algorithm is focused on inverting step-reduced MD4, the second one is not area-specific and so is applicable to a variety of classes of hard SAT instances. In this study, for the first time in history, 40-, 41-, 42-, and 43-step MD4 are inverted via the first algorithm and the estimating mode of the second algorithm. 28-step MD5 is inverted for four hashes via the incomplete SAT-solving mode of the second algorithm. For three hashes out of them this is done for the first time.

translated by 谷歌翻译

A Novel Semisupervised Contrastive Regression Framework for Forest Inventory Mapping with Multisensor Satellite Data

Shaojia Ge , Hong Gu , Weimin Su , Anne Lönnqvist , Oleg Antropov

分类：计算机视觉

2022-12-01

Accurate mapping of forests is critical for forest management and carbon stocks monitoring. Deep learning is becoming more popular in Earth Observation (EO), however, the availability of reference data limits its potential in wide-area forest mapping. To overcome those limitations, here we introduce contrastive regression into EO based forest mapping and develop a novel semisupervised regression framework for wall-to-wall mapping of continuous forest variables. It combines supervised contrastive regression loss and semi-supervised Cross-Pseudo Regression loss. The framework is demonstrated over a boreal forest site using Copernicus Sentinel-1 and Sentinel-2 imagery for mapping forest tree height. Achieved prediction accuracies are strongly better compared to using vanilla UNet or traditional regression models, with relative RMSE of 15.1% on stand level. We expect that developed framework can be used for modeling other forest variables and EO datasets.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Medical Image Captioning via Generative Pretrained Transformers

Alexander Selivanov , Oleg Y. Rogov , Daniil Chesakov , Artem Shelmanov , Irina Fedulova , Dmitry V. Dylov

分类：计算机视觉 | 人工智能

2022-09-28

自动临床标题生成问题被称为建议模型，将额叶X射线扫描与放射学记录中的结构化患者信息结合在一起。我们将两种语言模型结合在一起，即表演 - 泰尔和GPT-3，以生成全面和描述性的放射学记录。这些模型的建议组合产生了文本摘要，其中包含有关发现的病理，其位置以及将每个病理定位在原始X射线扫描中的每个病理的2D热图。提出的模型在两个医学数据集（Open-I，Mimic-CXR和通用MS-Coco）上进行了测试。用自然语言评估指标测量的结果证明了它们对胸部X射线图像字幕的有效适用性。

translated by 谷歌翻译

Multivariate Wasserstein Functional Connectivity for Autism Screening

Oleg Kachan , Alexander Bernstein

分类：计算机视觉

2022-09-23

大多数从功能磁共振成像（fMRI）数据估算大脑功能连接性的方法依赖于计算统计依赖性的某些度量，或者更一般地，单变量代表性的时间序列（ROIS）（ROI）由多个Voxels组成。但是，总结ROI的多个时间序列具有其平均值或第一个主成分（1pc）可能导致信息丢失，例如，1PC仅解释了神经元活动的多变量信号的一小部分。我们建议在不使用代表性时间序列的情况下直接比较ROI，并根据Wasserstein距离定义了ROI之间的新的多元连通性量度，不一定由相同数量的体素组成。我们在自闭症筛查任务上评估了拟议的Wasserstein功能连接度量，证明了其优越性优于常用单变量和多元功能连通性测量。

translated by 谷歌翻译

A Unified Perspective on Natural Gradient Variational Inference with Gaussian Mixture Models

Oleg Arenz , Philipp Dahlinger , Zihan Ye , Michael Volpp , Gerhard Neumann

分类：机器学习 | 机器人 | (统计)机器学习

2022-09-23

使用高斯混合模型（GMM）的变异推断能够学习可侵入性目标分布的高度扣除但多模式的近似值。 GMM与最多数百个维度的问题设置特别相关，例如机器人技术，用于对轨迹或联合分布进行建模。这项工作着重于基于GMM的两种非常有效的方法，这些方法既采用独立的自然梯度更新来为单个组件和权重的分类分布。我们首次表明，尽管它们的实际实现和理论保证有所不同，但他们的派生更新是等效的。我们确定了几种设计选择，可以区分两种方法，即在样本选择，自然梯度估计，步骤适应以及信任区域是否得到强制或适应的组件数量方面。我们对这些设计选择进行广泛的消融，并表明它们强烈影响了优化的效率和学习分布的可变性。基于我们的见解，我们提出了对广义框架的新颖实例化，该实例将一阶自然梯度估计与信任区域和组件适应相结合，并且在我们所有实验中都显着优于以前的两种方法。

translated by 谷歌翻译